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Approved Research Use Statement 


Lee et al. (2018) recently reported low transethnic validity for their educational PGS. Because of this low validity, the authors concluded that, with 
respect to years of education, scores will be most useful for individuals of European descent. However, Lee et al. (2018) did not rigorously evaluate 
transethnic validity, and Lee et al.’s (2018) discussion omits important caveats; namely, the method of score construction and the criterion trait used. 
PGS created in a way that increases the ratio of tagged to causal variants will decrease transethnic validity. Moreover, a reduced trait heritability in 


one population will also decrease PGS validity. 


In the case of Lee et al. (2018), the PGS for the transethnic validity analysis was constructed using a low p-value threshold of 1. This means that the 
authors included all SNPs associated with education, regardless of the significance of the association. Using all SNPs regardless of the strength of 
the association likely reduces validity, as it increases the proportion of non-causal SNPs. Moreover, the population specific heritability of the trait 
chosen (educational attainment), given the reference sample (elder African Americans), is unknown. This renders the results of Lee et al.’s (2018; 
and other similar analyses) uninterpretable, which therefore undermines the authors’ claims. 


We plan to systematically investigate the effect of PGS method construction and criterion trait on transethnic validity using the Trajectories of 
Complex Phenotypes dataset. We will focus on two traits: educational attainment / intelligence and schizophrenia. We would like to use the full 
cohort to maximize statistical power. The study will be a correlational analysis of the statistical association between various PGSs and cognitive 
ability. We will conduct analyses separately in White and African American samples. For these analyses, we will need cognitive data (all available) to 
create general and broad ability indexes, and demographic data (age, sex, etc.). We will also need SNP genotypes to create PGS scores and to 
control for genetic relatedness following the protocol of Lee et al. (2018). 


References 


Lee, J. J., Wedow, R., Okbay, A., Kong, E., Maghzian, O., Zacher, M., ... & Fontana, M. A. (2018). Gene discovery and polygenic prediction from a 
genome-wide association study of educational attainment in 1.1 million individuals. Nature genetics, 1. 


Marigorta, U. M., & Navarro, A. (2013). High Trans-ethnic Replicability of GWAS Results Implies Common Causal Variants. PLoS Genetics, 9(6). 


Ripke, S., Neale, B. M., Corvin, A., Walters, J. T., Farh, K. H., Holmans, P.A., ... & Pers, T. H. (2014). Biological insights from 108 
schizophrenia-associated genetic loci. Nature, 511(7510), 421. 


Non-Technical Summary 


Several recent studies have constructed polygenic scores (PGS) for various traits. Examples include educational attainment (Lee et al., 2018) and 
schizophrenia (Ripke, 2014). Of significant concern is the transethnic validity of PGS (Ntzani, Liberopoulos, Manolio, & Loannidis, 2011), especially 
for relatively distant ancestry groups. This situation potentially limits the utility of PGS for epidemiological research. 


However, the method of construction of PGS is a critical factor in determining transethnic validity. PGS which capture more common causal variants 
are expected to be more generalizable than ones which capture more tag variants (Marigorta & Navarro, 2013). This is because linkage 
disequilibrium decay, which reduces predictive validity across populations, is less of a problem. We propose to research the effect of PGS 
construction on the transethnic validity of PGS for two traits; educational attainment and schizophrenia. We shall also discuss best practices for 


constructing and reporting PGS. 


Collaborators 

Change Log 

2019-12-03 04:00 Research Progress 
2019-12-03 04:00 Inappropriate Data Use 
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Research Progress 


Research Summary 


We have no new publications nor submissions from this dataset. We are waiting to hear back from NIH as to whether we are still able to use this (we 
were told not to run further analyses until we heard back). 


Scientific Presentations 
none 


Publications 
none 


Intellectual Property 


none 


Data Security 


Datasets not described in the Research Use Statement 


none 


Inappropriate Data Use 


My understanding is the DAC might be still reviewing an incident. | have not heard back from them, despite sending a follow up email (I did reply to 


the DAC's request for information a few months ago) 
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Consent Group(s) Information 


phs000607.v3.p2: 


DUC 


Af 
DAR 


Request Date . 


Name 


Consent Group # 


Abbreviation ° 


Daia Use Limitation 


This dbGaP Authorized Access Project Renewal was prepared on 2019-12-13-05:00 UTC. 


Neurodevelopmental Genomics: Trajectories of Complex Phenotypes 


see attached 


: 73948 

2018-12-10 Last Renewal Daie . 2018-12-10 
. General Research Use (NPU) 

1 

GRU-NPU 


Use of the data is limited only by the terms of the model Data Use Certification. 
Use of the data is limited to not-for-profit organizations. 


Restricted to research on genotype-phenotype associations; and molecular phenotype (e.g., gene expression; 
microRNA)-phenotype associations. Non-profit use only. 


Page: 4 


Case: 1:23-cv-00546-DAP Doc #: 38-13 Filed: 07/26/24 5 of 15. PagelD #: 1034 


DATA USE CERTIFICATION AGREEMENT 
(April 19, 2019, version) 


This Data Use Certification Agreement outlines the terms of use for requested controlled-access 
datasets maintained in NIH-designated data repositories under the NIH Genomic Data Sharing Policy 
(e.g., the NIH database of Genotypes and Phenotypes (dbGaP)). The Addendum to this Agreement 
outlines additional terms and information which are specific to each requested dataset such as: 

¢ Data Use Limitation(s) 

* Sponsoring NIH Institute or Center 

* Responsible Data Access Committee 

* Study Description 

* Suggested Acknowledgement Statement 


INTRODUCTION AND STATEMENT OF POLICY 


The National Institutes of Health (NIH) has established NIH-designated data repositories (e.g., database 
of Genotypes and Phenotypes (dbGaP), Sequence Read Archive (SRA), NIH Established Trusted 
Partnerships) for securely storing and sharing controlled-access human data submitted to NIH under the 


NIH Genomic Data Sharing (GDS) Policy. Because the volume of human genomic and phenotypic data 


maintained in these repositories is substantial and, in some instances, potentially sensitive (e.g., data 
related to the presence or risk of developing particular diseases or conditions and information regarding 
family relationships or ancestry), data must be shared in a manner consistent with the research 
participants’ informed consent, and the confidentiality of the data and the privacy of participants must 
be protected. 


Access to human genomic data will be provided to research investigators who, along with their 
institutions, have certified their agreement with the expectations and terms of access detailed below. 


NIH expects that, through Data Access Request (DAR) process, approved users of controlled-access 
datasets recognize any restrictions on data use established by the Submitting Institutions through the 
Institutional Certification, and as stated on the dbGaP study page. 


Definitions of the underlined terminology in this document are found in section 13. 


The parties to this Agreement include: the Principal Investigator (Pl) requesting access to the genomic 
study dataset (an “Approved User”), the Pl’s home institution (the “Requester”) as represented by the 
Institutional Signing Official designated through the eRA Commons system, and the NIH. The effective 
date of this Agreement shall be the DAR Approval Date, as specified in the notification of approval of the 


Data Access Committee (DAC). 
TERMS OF ACCESS 


1. Research Use 


The Requester agrees that if access is approved, (1) the Pl named in the DAR and (2) those named in the 
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“Senior/Key Person Profile” section of the DAR, including the Information Technology Director and any 
trainee, employee, or contractor? working on the proposed research project under the direct oversight 
of these individuals, shall become Approved Users of the requested dataset(s). Research use will occur 
solely in connection with the approved research project described in the DAR, which includes a 1-2 
paragraph description of the proposed research (i.e., a Research Use Statement). Investigators 
interested in using Cloud Computing for data storage and analysis must request permission to use Cloud 
Computing in the DAR and identify the Cloud Service Provider (CSP) or providers and/or Private Cloud 
System (PCS) that they propose to use. They must also submit a Cloud Computing Use Statement as part 
of the DAR that describes the type of service and how it will be used to carry out the proposed research 
as described in the Research Use Statement. If the Approved Users plan to collaborate with 
investigators outside the Requester, the investigators at each external site must submit an independent 
DAR using the same project title and Research Use Statement, and if using the cloud, Cloud Computing 
Use Statement. New uses of these data outside those described in the DAR will require submission of a 
new DAR; modifications to the research project will require submission of an amendment to this 
application (e.g., adding or deleting Requester Collaborators from the Requester, adding datasets to an 
approved project). Access to the requested dataset(s) is granted for a period of one (1) year, with the 
option to renew access or close-out a project at the end of that year. 


Submitting Investigator(s), or their collaborators, who provided the data or samples used to generate 
controlled-access datasets subject to the NIH GDS Policy and who have Institutional Review Board (IRB) 
approval and who meet any other study specific terms of access, are exempt from the limitation on the 


scope of the research use as defined in the DAR. 
2. Requester and Approved User Responsibilities 


The Requester agrees through the submission of the DAR that the PI named has reviewed and 
understands the principles for responsible research use and data management of the genomic datasets 


as defined in the NIH Security Best Practices for Controlled-Access Data Subject to the GDS Policy. The 
Requester and Approved Users further acknowledge that they are responsible for ensuring that all uses 


of the data are consistent with national, tribal, and state laws and regulations, as appropriate, as well 
as relevant institutional policies and procedures for managing sensitive genomic and phenotypic data. 
The Requester certifies that the PI is in good standing (i.e., no known sanctions) with the institution, 
relevant funding agencies, and regulatory agencies and is eligible to conduct independent research 
(i.e., is not a postdoctoral fellow, student, or trainee). The Requester and any Approved Users may use 
the dataset(s) only in accordance with the parameters described on the study page and in the 


1 If contractor services are to be utilized, Pl requesting the data must provide a brief description of the services that 
the contractor will perform for the Pl (e.g., data cleaning services) in the research use statement of the DAR. 
Additionally, the Key Personnel section of the DAR must include the name of the contractor’s employee(s) who will 
conduct the work. These requirements apply whether the contractor carries out the work at the Pl’s facility or at 
the contractor’s facility. In addition, the Pl is expected to include in any contract agreement requirements to 
ensure that any of the contractor’s employees who have access to the data adhere to the NIH GDS Policy, this Data 
Use Certification Agreement, and the NIH Security Best Practices for Controlled-Access Data Subject to the GDS 
Policy. Note that any scientific collaborators, including contractors, who are not at the Requester must submit 
their own DAR. 
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Addendum to this Agreement for the appropriate research use, as well as any limitations on such use, 
of the dataset(s), as described in the DAR, and as required by law. 

Through the submission of this DAR, the Requester and Approved Users acknowledge receiving and 
reviewing a copy of the Addendum which includes Data Use Limitation(s) for each dataset requested. 
The Requester and Approved Users agree to comply with the terms listed in the Addendum. 


Through submission of the DAR, the PI and Requester agree to submit a Project Renewal or Project 
Close-out prior to the expiration date of the one (1) year data access period. The Pl also agrees to 
submit an annual Progress Update prior to the one (1) year anniversary” of the project, as described 
under Research Use Reporting (Term 10) below. 


By approving and submitting the attached DAR, the Institutional Signing Official provides assurance that 
relevant institutional policies and applicable local, state, tribal, and federal laws and regulations, as 
applicable, have been followed, including IRB approval, if required. Approved Users may be required to 
have IRB approval if they have access to personal identifying information for research participants in 
the original study at their institution, or through their collaborators. The Institutional Signing Official 
also assures, through the approval of the DAR, that other institutional departments with relevant 
authorities (e.g., those overseeing human subjects research, information technology, technology 
transfer) have reviewed the relevant sections of the NIH GDS Policy and the associated procedures and 
are in agreement with the principles defined. 


The Requester acknowledges that controlled-access datasets subject to the NIH GDS Policy may 
be updated to exclude or include additional information. Unless otherwise indicated, all 
statements herein are presumed to be true and applicable to the access and use of all versions of 


these datasets. 


3. Public Posting of Approved Users’ Research Use Statement 


The Pl agrees that information about themselves and the approved research use will be posted publicly 
on the dbGaP website. The information includes the PI’s name and Requester, project name, Research 
Use Statement, and a Non-Technical Summary of the Research Use Statement. In addition, and if 
applicable, this information may include the Cloud Computing Use Statement and name of the CSP or 
PCS. Citations of publications resulting from the use of controlled-access datasets obtained through this 
DAR may also be posted on the dbGaP website. 


4. Non-Identification 


Approved Users agree not to use the requested datasets, either alone or in concert with any other 
information, to identify or contact individual participants from whom data and/or samples were 
collected. Approved Users also agree not to generate information (e.g., facial images or comparable 
representations) that could allow the identities of research participants to be readily ascertained. These 
provisions do not apply to research investigators operating with specific IRB approval, pursuant to 45 
CFR 46, to contact individuals within datasets or to obtain and use identifying information under an 


2 The project anniversary date can be found in “My Projects” after logging in to the dbGaP authorized-access 
portal. 
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IRB-approved research protocol. All investigators including any Approved User conducting “human 
subjects research” within the scope of 45 CFR 46 must comply with the requirements contained 


therein. 
5. Certificate of Confidentiality 


Effective June 11, 2017 the Certificate of Confidentiality (Certificate) issued for the database of 
Genotypes and Phenotypes (dbGaP) is subject to the requirements of section 301(d) of the Public Health 
Service Act (42 U.S.C. 241(d)). Moreover, as of October 1, 2017 dbGaP is required to adhere to the N/H 
Policy for Issuing Certificates of Confidentiality (NOT-OD-17-109). Therefore, Approved Users of dbGaP, 
whether or not funded by the NIH, who access a copy of information protected by a Certificate held by 
dbGaP, are also subject to the requirements of the Certificate of Confidentiality and subsection 301(d) of 
the Public Health Service Act. 


Under Section 301(d) of the Public Health Service Act and the N/H Policy for Issuing Certificates of 
Confidentiality, recipients of a Certificate of Confidentiality shall not: 


* Disclose or provide, in any Federal, State, or local civil, criminal, administrative, legislative, or 
other proceeding, the name of such individual or any such information, document, or 
biospecimen that contains identifiable, sensitive information about the individual and that was 
created or compiled for purposes of the research, unless such disclosure or use is made with the 
consent of the individual whom the information, document, or biospecimen pertains; or 

¢ Disclose or provide to any other person not connected with the research the name of such an 
individual or any information, document, or biospecimen that contains identifiable, sensitive 
information about such an individual and that was created or compiled for purposes of the 
research. 

Disclosure is permitted only when: 


* Required by Federal, State, or local laws (e.g., as required by the Federal Food, Drug, and 
Cosmetic Act, or state laws requiring the reporting of communicable diseases to State and local 
health departments), excluding instances of disclosure in any Federal, State, or local civil, 
criminal, administrative, legislative, or other proceeding; 

* Necessary for the medical treatment of the individual to whom the information, document, or 
biospecimen pertains and made with the consent of such individual; 


¢ Made with the consent of the individual to whom the information, document, or biospecimen 
pertains; or 

¢ Made for the purposes of other scientific research that is in compliance with applicable Federal 
regulations governing the protection of human subjects in research. 


6. Non-Transferability 


The Requester and Approved Users agree to retain control of NIH controlled-access datasets obtained 
through the attached DAR, and any Data Derivatives of controlled-access datasets, and further agree not 
to distribute controlled-access datasets and Data Derivatives of controlled-access datasets to any entity 
or individual not identified in the submitted DAR. If the Approved Users are provided access to 
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controlled-access datasets subject to the NIH GDS Policy for inter-institutional collaborative research 
described in the Research Use Statement of the DAR, and all members of the collaboration are also 
Approved Users through their home institution(s), data obtained through the attached DAR may be 
securely transmitted within the collaborative group. Each Approved User will follow all data security 
practices and other terms of use defined in this Agreement, the NIH Security Best Practices for 
Controlled-Access Data Subject to the GDS Policy, and the Requester’s IT security requirements and 
policies. 


The Requester and Approved Users acknowledge responsibility for ensuring the review and agreement 
to the terms within this Agreement and the appropriate research use of controlled-access data obtained 
through the attached DAR and any Data Derivatives of controlled-access datasets by research staff 
associated with any approved project, subject to applicable laws and regulations. Requester and 
Approved Users agree that controlled-access datasets obtained through the attached DAR and any Data 
Derivatives of controlled-access datasets, in whole or in part, may not be sold to any individual at any 


point in time for any purpose. 


The Pl agrees that if they change institutions during the access period they will complete the Project 
Close-out process (See Term 12 for more details) before moving to their new institution. A new DAR, in 


which the new Requester agrees to the Data Use Certification Agreement and the Genomic Data User 
Code of Conduct, must be approved by the relevant NIH DAC(s) before controlled-access data may be 


reaccessed. 


7. Data Security and Unauthorized Data Release 


The Requester and Approved Users, including the Requester’s IT Director, acknowledge NIH’s 
expectation that they have reviewed and agree to manage the requested controlled-access dataset(s) 


and any Data Derivatives of controlled-access datasets according to NIH’s expectations set forth in the 
current NIH Security Best Practices for Controlled-Access Data Subject to the GDS Policy and the 
Requester’s IT security requirements and policies. The Requester, including the Requester’s IT Director, 
agree that the Requester’s IT security requirements and policies are sufficient to protect the 
confidentiality and integrity of the NIH controlled-access data entrusted to the Requester. 


If approved by NIH to use cloud computing for the proposed research project, as outlined in the 
Research and Cloud Computing Use Statements of the Data Access Request, the Requester 
acknowledges that the IT Director has reviewed and understands the cloud computing guidelines in the 
NIH Security Best Practices for Controlled-Access Data Subject to the NIH GDS Policy. 


The Requester and Pl agree to notify the appropriate DAC(s) of any unauthorized data sharing, 
breaches of data security, or inadvertent data releases that may compromise data confidentiality 
within 24 hours of when the incident is identified. As permitted by law, notifications should include any 
known information regarding the incident and a general description of the activities or process in place 
to define and remediate the situation fully. Within 3 business days of the DAC notification, the 
Requester agrees to submit to the DAC(s) a detailed written report including the date and nature of the 
event, actions taken or to be taken to remediate the issue(s), and plans or processes developed to 
prevent further problems, including specific information on timelines anticipated for action. The 
Requester agrees to provide documentation verifying that the remediation plans have been 
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implemented. Repeated violations or unresponsiveness to NIH requests may result in further 
compliance measures affecting the Requester. 


All notifications and written reports of data security incidents and policy compliance violations should 
be sent to the DAC(s) indicated in the Addendum to this Agreement. 


NIH, or another entity designated by NIH may, as permitted by law, also investigate any data security 
incident or policy violation. Approved Users and their associates agree to support such investigations 
and provide information, within the limits of applicable local, state, tribal, and federal laws and 
regulations. In addition, Requester and Approved Users agree to work with the NIH to assure that plans 
and procedures that are developed to address identified problems are mutually acceptable and 
consistent with applicable law. 


8. Policy Compliance Violations 

The Requester and Approved Users acknowledge that the NIH may terminate the DAR, including this 
Agreement and immediately revoke or suspend access to all controlled-access datasets subject to 

the NIH GDS Policy at any time if the Requester is found to be no longer in agreement with the 
principles outlined in the NIH GDS Policy, the terms described in this Agreement, or the Genomic 

Data User Code of Conduct. The Requester and Pl agree to notify the NIH of any violations of the 

NIH GDS Policy, this Agreement, or the Genomic Data User Code of Conduct data within 24 hours of 
when the incident is identified. Repeated violations or unresponsiveness to NIH requests may result 

in further compliance measures affecting the Requester. 

The Requester and PI agree to notify the appropriate DAC(s) of any unauthorized data sharing, 
breaches of data security, or inadvertent data releases that may compromise data confidentiality 
within 24 hours of when the incident is identified. As permitted by law, notifications should include any 
known information regarding the incident and a general description of the activities or process in place 
to define and remediate the situation fully. Within 3 business days of the DAC notification(s), the 
Requester agrees to submit to the DAC(s) a detailed written report including the date and nature of the 
event, actions taken or to be taken to remediate the issue(s), and plans or processes developed to 
prevent further problems, including specific information on timelines anticipated for action. The 
Requester agrees to provide documentation verifying that the remediation plans have been 
implemented. Repeated violations or unresponsiveness to NIH requests may result in further 
compliance measures affecting the Requester. 


All notifications and written reports of data management incidents should be sent to the DAC(s) 
indicated in the Addendum to this Agreement. 


NIH, or another entity designated by NIH may, as permitted by law, also investigate any data security 
incident or policy violation. Approved Users and their associates agree to support such investigations 
and provide information, within the limits of applicable local, state, tribal, and federal laws and 
regulations. In addition, Requester and Approved Users agree to work with the NIH to assure that plans 
and procedures that are developed to address identified problems are mutually acceptable and 
consistent with applicable law. 


9. Intellectual Property 


Case: 1:23-cv-00546-DAP Doc #: 38-13 Filed: 07/26/24 11 of 15. PagelD #: 1040 


By requesting access to genomic dataset(s), the Requester and Approved Users acknowledge the intent 
of the NIH that anyone authorized for research access through the attached DAR follow the intellectual 
property (IP) principles in the NIH GDS Policy as summarized below: 


Achieving maximum public benefit is the ultimate goal of data distribution through the 
NiHdesignated data repositories. The NIH encourages broad use of NIiH-supported 
genotypephenotype data that is consistent with a responsible approach to management of 
intellectual property derived from downstream discoveries, as outlined in the NIH Best Practices 


for the Licensing of Genomic Inventions and its Research Tools Policy. 


The NIH considers these data as pre-competitive and urges Approved Users to avoid making IP 
claims derived directly from the genomic dataset(s). It is expected that these NIH-provided 
data, and conclusions derived therefrom, will remain freely available, without requirement for 
licensing. However, the NIH also recognizes the importance of the subsequent development of 
IP on downstream discoveries, especially in therapeutics, which will be necessary to support full 
investment in products to benefit the public. 


10. Dissemination of Research Findings and Acknowledgement of Controlled-Access Datasets Subject 
to the NIH GDS Policy 


It is NIH’s intent to promote the dissemination of research findings from use of controlled-access 
dataset(s) subject to the NIH GDS Policy as widely as possible through scientific publication or other 
appropriate public dissemination mechanisms. Approved Users are strongly encouraged to publish their 
results in peer-reviewed journals and to present research findings at scientific meetings. 


Approved Users agree to acknowledge the Submitting Investigator(s) who submitted data from the 
original study to an NIH-designated data repository, the primary funding organization that supported 
the Submitting Investigator(s), and the NIH-designated data repository, in all oral and written 
presentations, disclosures, and publications resulting from any analyses of controlled-access data 
obtained through the attached DAR. Approved Users further agree that the acknowledgment shall 
include the dbGaP accession number to the specific version of the dataset(s) analyzed. A sample 
acknowledgment statement is provided for each dataset in the Addendum to this Agreement. 


11. Research Use Reporting 


To assure adherence to NIH GDS Policy, the Pl agrees to provide annual Progress Updates as part of the 
annual Project Renewal or Project Close-out processes, prior to the expiration of the one (1) year data 
access period. The PI who is seeking Renewal or Close-out of a project agree to complete the 
appropriate online forms and provide specific information such as how the data have been used, 
including publications or presentations that resulted from the use of the requested dataset(s), a 
summary of any plans for future research use (if the Pl is seeking renewal), any violations of the terms of 
access described within this Agreement and the implemented remediation, and information on any 
downstream intellectual property generated from the data. The Pl also may include general comments 
regarding suggestions for improving the data access process in general. Information provided in the 
progress updates helps NIH evaluate program activities and may be considered by the NIH GDS 
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governance committees as part of NIH’s effort to provide ongoing stewardship of data sharing activities 
subject to the NIH GDS Policy. 


12. Non-Endorsement, Indemnification 


The Requester and Approved Users acknowledge that although all reasonable efforts have been taken 
to ensure the accuracy and reliability of controlled-access data obtained through the attached DAR, 
the NIH and Submitting Investigator(s) do not and cannot warrant the results that may be obtained by 
using any data included therein. NIH and all contributors to these datasets disclaim all warranties as to 


performance or fitness of the data for any particular purpose. 


No indemnification for any loss, claim, damage, or liability is intended or provided by any party under 
this agreement. Each party shall be liable for any loss, claim, damage, or liability that said party incurs as 
a result of its activities under this agreement, except that NIH, as an agency of the United States, may be 
liable only to the extent provided under the Federal Tort Claims Act, 28 USC 2671 et seq. 


13. Termination and Data Destruction 


Upon Project Close-out, the Requester and Approved Users agree to destroy all copies, versions, and 
Data Derivatives of the dataset(s) retrieved from NIH-designated controlled-access databases, on both 


local servers and hardware, and if cloud computing was used, delete the data and cloud images from 
cloud computing provider storage, virtual and physical machines, databases, and random access 
archives, in accord with the NIH Security Best Practices for Controlled-Access Data Subject to the NIH 


Genomic Data Sharing (GDS) Policy. However, the Requester may retain these data as necessary to 
comply with any institutional policies (e.g., scientific data retention policy), law, and scientific 


transparency expectations for disseminated research results, and/or journal policies. A Requester who 
retains data for any of these purposes continues to be a steward of the data and is responsible for the 
management of the retained data in accordance with the NIH Security Best Practices for 


ControlledAccess Data Subject to the NIH Genomic Data Sharing (GDS) Policy, and any institutional 


policies. Any retained data may only be used by the PI and Requester to support the findings (e.g., 
validation) resulting from the research described in the DAR that was submitted by the Requester and 
approved by NIH. The data may not be used to answer any additional research questions, even if they 
are within the scope of the approved Data Access Request, unless the Requester submits a new DAR and 
is approved by NIH to conduct the additional research. If a Requester retains data for any of these 
purposes, the relevant portions of Terms 4, 5, 6, 7, 8, and 12 remain in effect after termination of this 
Data Use Certification Agreement. These terms remain in effect until the data is destroyed. 


14. DEFINITIONS 


Approved User: A user approved by the relevant Data Access Committee(s) to access one or more 
datasets for a specified period of time and only for the purposes outlined in the Principal Investigator 


(PI)’s approved Research Use Statement. The Information Technology (IT) Director indicated on the 
Data Access Request, as well as any staff members and trainees under the direct supervision of the Pl 
are also Approved Users and must abide by the terms laid out in the Data Use Certification Agreement. 


Collaborator: An individual who is not under the direct supervision of the Pl (e.g., not a member of the 
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Pl’s laboratory) who assists with the Pl’s research project involving controlled-access data subject to the 
NIH GDS Policy. Internal collaborators are employees of the Requester and work at the same 
location/campus as the PI. External collaborators are not employees of the Requester and/or do not 
work at the same location as the PI, and consequently must be independently approved to access 
controlled-access data subject to the NIH GDS Policy. 


Cloud Computing: The National Institute for Standards and Technology defines cloud computing as a 
model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable 
computing resources (e.g., networks, servers, storage, applications, and services) that can be rapidly 
provisioned and released with minimal management effort or service provider interaction. For more 


information see NIST Special Publication 800-145. 


Cloud Service Provider (CSP): A company or institution that offers some component of cloud computing 
to other businesses or individual, typically Infrastructure as a Service (laaS), Software as a Service (SaaS) 
or Platform as a Service (PaaS), as defined by the National Institute of Standards and Technology. For 
more information see NIST Special Publication 800-145. 

Data Access Request (DAR): A request submitted to a Data Access Committee for a specific “consent 
group” specifying the data to which access is sought, the planned research use, and the names of 
collaborators and the IT Director. The DAR is signed by the PI requesting the data and her/his 
Institutional Signing Official. Requester Collaborators and project team members on a request must be 
from the same organization. 


Data Derivative: Data derived from controlled-access datasets obtained from NIH-designated data 
repositories. Examples of derived data include imputed datasets and single nucleotide polymorphisms. 


Data Use Certification (DUC) Agreement: An agreement between the Approved User, the Requester, 
and NIH regarding the terms associated with access of controlled-access datasets subject to the NIH GDS 


Policy and the expectations for use of these datasets. 


Genomic Data User Code of Conduct: Key principles and practices agreed to by all research investigators 
requesting access to controlled-access data subject to the NIH GDS Policy. The elements within the 
Genomic Data User Code of Conduct reflect the terms of access in the Data Use Certification Agreement. 
Failure to abide by the Genomic Code of Conduct may result in revocation of an investigator’s access to 
any and all approved datasets. 


Information Technology (IT) Director: An Approved User who is generally a senior IT official of the 
Requester with the necessary expertise and authority to affirm the IT capacities at the Requester. The 
IT Director is expected to have the authority and capacity to ensure that the NIH Security Best Practices 


for Controlled-Access Data Subject to the NIH GDS Policy and the Requester’s IT security requirements 
and policies are followed by all of the Requester’s Approved Users. 


Institutional Certification: Certification by the Submitting Institution that delineates, among other 
items, the appropriate research uses of the data and the uses that are specifically excluded by the 
relevant informed consent documents. Further information may be found here. 
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Institutional Signing Official: The label, "Signing Official," is used in conjunction with the NIH eRA 
Commons and refers to the individual that has institutional authority to legally bind the institution in 
grants administration matters. The individual fulfilling this role may have any number of titles in the 
institution, but is typically located in its Office of Sponsored Research or equivalent. The Signing Official 
for the Requester reviews Data Access Request, Project Renewal, and Project Close-out applications 
submitted by Principal Investigators and legally binds the Requester to agree to adhere to the terms 
described in this Agreement if the application is submitted to NIH. The Institutional Signing Official for 
the Submitting Institution enters into the Institutional Certification and signs on behalf of the Submitting 
Investigator(s) who has submitted data. 


Principal Investigator (PI): The investigator who prepares Data Access Requests (DARs), Project 


Renewals, and Project close-outs. The Principal Investigator plays a lead role in ensuring that 
management and use of controlled-access data remains consistent with the terms in the Data Use 


Certification Agreement. To be able to submit a DAR, a Principal Investigator must be designated as such 
by their institution in eRA Commons and be a permanent employee of their institution at a level 
equivalent to a tenure-track professor or senior scientist with responsibilities that most likely include 
laboratory administration and oversight. 


Private Cloud System (PCS): A cloud infrastructure provisioned for exclusive use by a single organization 
comprising multiple consumers (e.g., business units). It may be owned, managed, and operated by the 
Requester, a third party, or some combination of them, and it may exist on or off premises. 


Progress Update: Information included with the annual Data Access Request (DAR) renewal or 
Closeout summarizing the analysis of controlled-access datasets obtained through the DAR and any 
publications and presentations derived from the work. 

Project Close-out: Termination of a research project that used controlled-access data from an 
NiHdesignated data repository (e.g., dbGaP) and confirmation of data destruction when the research is 
completed and/or discontinued. The project close-out process is completed in the dbGaP Authorized 


Access System. 


Project Renewal: Renewal of a Pl’s access to controlled-access datasets for a previously-approved 
project. 


Requester: The home institution or organization of the Approved User that applies to dbGaP for access 
to controlled-access data subject to the NIH GDS Policy. 
Submitting Institution: An organization who submitted a genomic dataset to an NIH-designated data 


repository (e.g., dbGaP). 


Submitting Investigator: An investigator who submitted a genomic dataset to an NIH designated data 
repository (e.g., dbGaP). 
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Study specific DUC addendum 


phs000607 : 


Neurodevelopmental Genomics: Trajectories of Complex Phenotypes 


Public Posting of Genomic Summary Results - Allowed. 


NIH Data Access Committee (DAC) : 


Important Contacts 


Joint Addiction, Aging, and Mental Health DAC 
JAAMHDAC@mail.nih.gov ; URGENTJAAMHDAC@mailnih.gaov; GDS@mail.nih.gov 


In the event of a data management incident, within 24 hours, please contact emails above. 


Acknowledgement Statement : 


Support for the collection of the data for Philadelphia Neurodevelopment Cohort (PNC) was provided by grant 
RC2MH089983 awarded to Raquel Gur and RC2MH089924 awarded to Hakon Hakonarson. Subjects were 
recruited and genotyped through the Center for Applied Genomics (CAG) at The Children's Hospital in 
Philadelphia (CHOP). Phenotypic data collection occurred at the CAG/CHOP and at the Brain Behavior 
Laboratory, University of Pennsylvania. 


Drs. Gur, Hakonarson, and collaborators request that publications resulting from publically accessed PNC data 
cite the following references according to the data types used. PNC references are listed below and are grouped 
on the basis of data type. 


Clinical Phenotypes 

Demographic, medical, and psychopathology history were assessed using a structured computerized instrument, 
named GOASSESS. Accordingly, for any studies that make use of clinical data, it is appropriate to cite the 
following two references: 

¢ Calkins ME, Moore TM, Merikangas KR, et al. The psychosis spectrum in a young U.S. community sample: 
findings from the Philadelphia Neurodevelopmental Cohort. World Psychiatry. 2014;13(3):296-305. 
doi:10.1002/wps.20152. https:/Avww.ncbi.nlm.nih.gov/pmc/articles/PMC4219071 

¢ Calkins ME, Merikangas KR, Moore TM, et al. The Philadelphia Neurodevelopmental Cohort: constructing a 
deep phenotyping collaborative. Journal of child psychology and psychiatry, and allied disciplines. 
2015;56(12):1356-1369. doi:10.111 1/jcpp.12416. https:/Avww.ncbi.nim.nih.gov/pmc/articles/PMC4598260/ 


Cognitive/CNB Phenotypes 


Cognitive assessment was performed using the Penn Computerized Neurocognitive Battery (CNB), which was 
customized and shortened for a pediatric population using both classical psychometric and item response theory 
methods. Studies that make use of the cognitive data should reference: 

¢ Gur RC, Calkins ME, Satterthwaite TD, et al. Neurocognitive Growth Charting in Psychosis Spectrum Youths. 
JAMA Psychiatry. 2014;71(4):366374. doi:10.1001/jamapsychiatry.2013.4190 
https://jamanetwork.com/journals/jamapsychiatry/fullarticle/1819579 


Neuroimaging Phenotypes 

The magnetic resonance imaging (MRI) protocol was comprised of scans designed to obtain information on brain 
structure, perfusion, structural connectivity, resting state functional connectivity, working memory function, and 
emotion identification. The following reference should be used to cite neuroimaging data from the PNC: 

* Satterthwaite TD, Elliott MA, Ruparel K, et al. Neuroimaging of the Philadelphia Neurodevelopmental Cohort. 
Neurolmage. 2014;86:544-553. doi:10.1016/j.neuroimage.2013.07.064. 
https:/Awww.ncbi.nim.nih.gov/pmc/articles/PMC3947233/ 


Genotyping Data 

The samples released were all previously genotyped using Illumina or Affymetrix SNP-array platform by the 
Center for Applied Genomics at The Children' s Hospital of Philadelphia. The following reference should be used 
to cite use of genotyping data from the PNC: 


¢ Joseph T Glessner, Muredach P Reilly, Cecilia E Kim, et al. Strong synaptic transmission impact by copy 
number variations in schizophrenia. PNAS, 2010. 107 (23) 10584-10589; 
https://doi.org/10.1073/pnas.1000274107 


General Research Use (NPU) 
1 
GRU-NPU 


Use of the data is limited only by the terms of the model Data Use Certification. 
Use of the data is limited to not-for-profit organizations. 


Restricted to research on genotype-phenotype associations; and molecular phenotype (e.g., gene expression; 
microRNA)-phenotype associations. Non-profit use only. 


